The Bimodal Distribution of Genic GC Content Is Ancestral to Monocot Species
نویسندگان
چکیده
In grasses such as rice or maize, the distribution of genic GC content is well known to be bimodal. It is mainly driven by GC content at third codon positions (GC3 for short). This feature is thought to be specific to grasses as closely related species like banana have a unimodal GC3 distribution. GC3 is associated with numerous genomics features and uncovering the origin of this peculiar distribution will help understanding the potential roles and consequences of GC3 variations within and between genomes. Until recently, the origin of the peculiar GC3 distribution in grasses has remained unknown. Thanks to the recent publication of several complete genomes and transcriptomes of nongrass monocots, we studied more than 1,000 groups of one-to-one orthologous genes in seven grasses and three outgroup species (banana, palm tree, and yam). Using a maximum likelihood-based method, we reconstructed GC3 at several ancestral nodes. We found that the bimodal GC3 distribution observed in extant grasses is ancestral to both grasses and most monocot species, and that other species studied here have lost this peculiar structure. We also found that GC3 in grass lineages is globally evolving very slowly and that the decreasing GC3 gradient observed from 5' to 3' along coding sequences is also conserved and ancestral to monocots. This result strongly challenges the previous views on the specificity of grass genomes and we discuss its implications for the possible causes of the evolution of GC content in monocots.
منابع مشابه
Cross-Species Analysis of Genic GC3 Content and DNA Methylation Patterns
The GC content in the third codon position (GC(3)) exhibits a unimodal distribution in many plant and animal genomes. Interestingly, grasses and homeotherm vertebrates exhibit a unique bimodal distribution. High GC(3) was previously found to be associated with variable expression, higher frequency of upstream TATA boxes, and an increase of GC(3) from 5' to 3'. Moreover, GC(3)-rich genes are pre...
متن کاملPatterns and evolution of nucleotide landscapes in seed plants.
Nucleotide landscapes, which are the way base composition is distributed along a genome, strongly vary among species. The underlying causes of these variations have been much debated. Though mutational bias and selection were initially invoked, GC-biased gene conversion (gBGC), a recombination-associated process favoring the G and C over A and T bases, is increasingly recognized as a major fact...
متن کاملEvolution of recombination and genome structure in eusocial insects
Eusocial Hymenoptera, such as the European honey bee, Apis mellifera, have the highest recombination rates of multicellular animals.(1) Recently, we showed(2) that a side-effect of recombination in the honey bee, GC biased gene conversion (bGC), helps maintain the unusual bimodal GC-content distribution of the bee genome by increasing GC-content in high recombination areas while low recombinati...
متن کاملOn a New Bimodal Normal Family
The unimodal distributions are frequently used in the theorical statistical studies. But in applied statistics, there are many situations in which the unimodal distributions can not be fitted to the data. For example, the distribution of the data outside the control zone in quality control or outlier observations in linear models and time series may require to be a bimodal. These situations, oc...
متن کاملA unique set of 11,008 onion expressed sequence tags reveals expressed sequence and genomic differences between the monocot orders Asparagales and Poales.
Enormous genomic resources have been developed for plants in the monocot order Poales; however, it is not clear how representative the Poales are for the monocots as a whole. The Asparagales are a monophyletic order sister to the lineage carrying the Poales and possess economically important plants such as asparagus, garlic, and onion. To assess the genomic differences between the Asparagales a...
متن کامل